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FIELD OF THE INVENTION 

The present invention concerns novel nucleic acid sequences, vectors and 
host cells containing them, amino acid sequences encoded by said sequences, and 
antibodies reactive with said amino acid sequences, as well as pharmaceutical 
5 compositions comprising any of the above. The present invention further concerns 
methods for screening for candidate activator or deactivators utilizing said amino 
acid sequences. — - 

BACKGROUND OF THE INVENTION 

Bone morphogenetic proteins (BMPs) are members of the TGS-p super 
L o family of peptides which have conserved carboxy-teriminal regions containing 
seven cysteine repeats. Multiple BMPs or osteogenic proteins (OPs) have been 
described including BMP-2, -3 (or osteogenin), -4 -5 -6 -7 (or OPI) and -8 (or 
OP2). BMPs are known for their role in embryonic development and differentiation 
and have also mitogenic properties for skeletal cells inducing the differentiation of 
15 mesenchymal cells into osteogenic cells as well as enhancement of the 
differentiation of osteoblast. They are also known to increase collagen synthesis 
and inhibit collagenase 3 expression by osteoblast. 

Chordin is a secreted glycoprotein with a molecular mass of 120 kD and is 
svnthesized by the Spemann organizer of the amphibian gastrula. Chordin mimics 
20 the action of the Spemann organizer which can induce the formation of neural 
tissue from ectoderm and induce dorsalization of the ventral mesoderm to form 
muscle. These two activities are opposed (i.e. antagonized) by BMPs and chordin 
binds and blocks the action of BMP-2 and BMP-4 by preventing receptor~binding. 
It has been postulated that chordin may be expressed by cells of the osteoblastic 



i:ne ,ce to limit BMP actions in the osteoblast. This would be a critical function tor 
a BMP hindino protein since excessive BMP-4 has been implicated in pathogenesis 
of fbrodv.piasia ossificans progressiva. It has been postulated that BMPs can 
cau<e induction of noggin and chordin mRNA and protein levels in skeletan cells 
bv ra.ncrirtional mechanisms, and in turn these prevent the effect of BMPs m 
osteoblast in a neeative-type feeback. The induction of these proteins by BMPs 
arrears to be a mechanism to limit the BMP effect in bones (Gazzerro et aL J. 
dir.. Invest.. 102(.12):2 106-2 114 (1993)). 



GLOSSARY . 

H the follow description and claims use will be made, at times, with a 

variety of terms, and the meaning of such terms as they should be construed m 
accordance with the invention is as follows: 

-Chordin like Homolog (CLH) nucleic acid seance" - the sequence shown in 
SEO ID NO 1 sequences having at least 70% identity ,o sa,d sequence and 
trasmems of UK above sequences oeutg 20b.p. iong. This sequence is a sequence 
coding for a novel homolog of the known Chordin proteul. 

"ihe novel CLH is a homolog to the known ehordms vv.thm the VWFC 
0 domatn. named after the von-Willebrand factor (VWF) type C repeat, whtchts 
found 2-4 tumes in these mulu-domatn proteins. The VWF dornam has a length of 
about 70 amino actds covertng 10 wel> conserved cysteines. The 
recion in complex-forming proteins leads ,0 the assumption that the VWFC 
domain might be involved in forming larger protein complexes. 

However, the term CLH does not necessarily signify that CLH protetn 
coded bv the above sequence has the same or even similar phystologtcal effects as 
known Chordte. mereW that it shows sequence homology wi* the known 
Chordin. 

a tn H nmduct) - also referred at times as the 
, 0 -Chordin like homology product (CLH product) J 

•^CLH protein" or 'CLH polypeptide" - is an ammo actd show, m SEQ i " 



The amino acid sequence may be a peptide,, a protein, as well as peptides or 
proteins having chemically modified amino acids (see below) such as a 
alycopeptide or glycoprotein. An example of an CLH product is shown in SEQ ID 
NO: 2. The term also includes analogues of said sequences in which one or more 
amino acids has been added, deleted, substituted (see below) or chemically 
modified (see below) as well as fragments of this sequence having at least 10 amino 
acids. 

"Nucleic acid sequence" - a sequence composed of DNA nucleotides, RNA 
nucleotides or a combination of both types and may includes natural nucleotides, 
chemicallv modified nucleotides and synthetic nucleotides. 

"Amino acid sequence" - a sequence composed of any one of the 20 naturally- 
appearing amino acids, amino acids which have been chemically modified (see 
below), or composed of synthetic amino acids. 

"Fragment of CLH product" - a polypeptide whicn has an amino acid sequence 
which is the same as part of but not all of the amino acid sequence of the CLH 
product. 

"Fragments of CLH nucleic acid sequence" a continuous portion, preferably of 
about 20 nucleic acid sequences of the CLH nucleic acid sequence. 

"Conservative substitution" - refers to the substitution of an amino acid in one 
class by an amino acid of the same class, where a class is defined by common 
physicochemical amino acid side chain properties and high substitution 
frequencies in homologous proteins found in nature, as determined, for example, 
by a standard Dayhoff frequency exchange matrix or BLOSUM matrix. [Six 
general classes of amino acid side chains have~been caJegorized" and include: 
Class I (Cys); Class II (Ser, Thr, Pro, Ala, Gly); Class III (Asn, Asp, Gin, Glu); 



Class IV (His. Arg. Lys): Class V die. Leu. Vai. Met): and Class VI <Phe. Tyr. 
Tr p ( _ p or example, substiaition of an Asp for another class III residue such as 
Asn. Gin. or Glu. is a conservative substitution. 

; "Son-conservative substitution" - refers to the substitution of an amino acid in 
one class with an amino acid from another class: for example, substitution of an 
Ala. a class II residue, with a class III residue such as Asp. Asn. Giu. or Gin. 



^Chemically modified" - when referring to the product of the invention, means a 
product (protein) where at least one of its amino acid resides is modified either by 
natural processes, such as processing or other post-translational modifications, or 
by chemical modification techniques which are well known in the art. Among 
the numerous known modifications typical, but not exclusive examples include: 
acetylation, acylation. amidation, ADP-ribosylation, glycosylation, GPI anchor 
formation, covalent attachment of a lipid or lipid derivative, methylation, 
nr ristlvation. Def lation, prenylation, phosphorylation, ubiqutination, or any 
similar process. 
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"Biologically active" - refers to the CLH product which have, regulatory or 
20 biochemical functions on the same target sites which naturally occurring CLH 
influence, for example can bind to the same receptor as the chordin (or to another 
receptor), can control BMPs expression, and regulate the amount of effective or 
biologically available BMP. In particular the CLH may have effects in osteoblasts 
as well as other cells of mesenchymal origin. 

"Immunologically active" defines the capability of a natural, recombinant or 
synthetic CLH product, or any fragment thereof, to induce a specific immune 
response in appropriate animals or cells and to bind with specific antibodies. 
Thus, for example, a biologically active fragment of CLH product denotes a 
30 fragment which retains some or all of the immunological properties of the CLH 



product, e.g can bind specific anti-CLH product antibodies or which can elicit an 
immune response which will generate such antibodies or cause proliferation of 
specific immune cells which produce CLH. 

"Optimal alignment" - is defined as an alignment giving the highest percent 
identity score. Such alignment can be performed using a variety of commercially 
available sequence analysis programs, such as the local alignment program 
L ALIGN using a ktup of L default parameters and the default PAM. A preferred 
alignment is the one performed using the CLUSTAL-W program from 
Mac Vector (TM). operated with an open gap penalty of 10. 0 ; an extended gap 
penalty of 0.1 ; and a BLOSUM similarity matrix. If a gap needs to be inserted 
into a first sequence "to optimally align it with a second sequence, the percent 
identity 7 is calculated using only the residues that are paired with a corresponding 
amino acid residue (i.e.. the calculation does not consider residues in the second 
sequences that are in the "gap" of the first sequence). 

"Having at least X% identity" - with respect to two amino acid or nucleic acid 
sequence sequences, refers to the percentage of residues that are identical in the 
two sequences when the sequences are optimally aligned. Thus. 70% amino acid 
sequence identity means that 70% of the amino acids in two or more optimally 
aligned polypeptide sequences are identical. 

"Isolated nucleic acid molecule having an CLH nucleic acid sequence" - is a 
nucleic acid molecule that includes the coding CLH nucleic acid sequence. Said 
isolated nucleic acid molecule may include the CLH nucleic acid sequence as an 
independent insert; may include the CLH nucleic acid sequence fused to an 
additional coding sequences, encoding together a fusion protein in which the 
CLH coding sequence is the dominant coding sequence (for ex ample, the 
additional coding sequence may code for a signal peptide); the CLH nucleic acid 
sequence may be in combination with non-coding sequences, e.g., introns or 
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control elements, such as promoter and terminator elements or 5' and or 3' 
untranslated regions, effective for expression of the coding sequence in a suitable 
host: or may be a vector in which the CLH protein coding sequence is a 
heterologous. 

"Expression vector" - refers to vectors that have the ability to incorporate and 
express heterologous DNA fragments in a foreign cell. Many prokaryotic and 
eukaryotic expression vectors are known and or commercially available. 
Selection of appropriate expression vectors is within the knowledge of those 
having skill in the an. 

"Deletion " - is a change in either nucleotide or amino acid sequence in which 
one or more nucleotides or amino acid residues, respectively, are absent. 

"Insertion" or "addition" - is that change in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or amino 
acid residues, respectively, as compared to the naturally occurring sequence. 

"Substitution" - replacement of one or more nucleotides or amino acids by 
20 different nucleotides or amino .cids, respectively. As regards amino acid 
sequences the substitution may be conservative or non- conservative. 

-Antibody" - refers to IgG, IgM, IgD. IgA, and IgG antibody. The definition 
includes polyclonal antibodies or monoclonal antibodies. This term refers to 
25 whole antibodies or fragments of the antibodies comprising the antigen-binding 
domain of the anti-CLH product antibodies, e.g. antibodies without the Fc 
portion, single chain antibodies, fragments consisting of essentially only the 
variable, antigen-binding domain of the antibody, etc. 
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"Activator" - as used herein, refers to a molecule which mimics the effect of the 
natural CLH product or at times even increases or prolongs the duration of the 
biological activity of said product, as compared to that induced by the natural 
product. The mechanism may be by binding to the same receptor of target 
moieties to which native CLH binds thus mimicking the activity of CLH; by 
prolonging the lifetime of the CLH ? (for example by decrease of the rate of its 
degradation) bv increasing the activitv of the CLH on its target (modulation of 
expression and amount of BMPs), by increasing the affinity of CLH to moieties 
which it binds (such as its receptors) etc. Activators may be polypeptides; nucleic 
acids, carbohydrates, lipids, or derivatives thereof or any other molecules which 
can bind to and activate the CLH product. 

"Deactivator" or ("Inhibitor") - refers to a molecule which modulates the 
activity of the CLH product in an opposite manner to that of the activator, by 
decreasing or shortening the duration of the biological activity of the CLH 
product. This may be done by blocking the binding of the CLH to its receptor 
(competitive or non-competitive inhibition), by causing rapid degradation of the 
CLH ; etc. by inhibiting association of the CLH with the effectors which regulate 
the expression of BMPs, etc. Deactivators may be polypeptides, nucleic acids, 
carbohydrates, lipids, or derivatives thereof, or any other molecules which bind to 
and modulate the activity of said product. 

"Treating a disease" - refers to administering a therapeutic substance effective 
to ameliorate symptoms associated with a disease, to lessen the severity or cure 
the disease, or to prevent the disease from occurring. 

"Detection" - refers to a method of detection of a disease. This term may refer to 
detection of a predisposition to a disease. 



"Probe" - the CLH nucleic acid sequence, or a sequence complementary 
therewith, when used to detect presence of other similar sequences in a sample. 
The detection is carried out by identification of hybridization complexes between 
the probe and the assayed sequence. The probe may be attached to a solid support 
or to a detectable label. 

SUMMARY OF THE INVENTION 

The present invention is based on me surprising finding that there exist in 
humans a novel homolog of the chordin protein having a significant homology- to 

the chordin protein. 

The novel CLH is a homolog to the known chordins within the VWFC 
domain, named after the von-Willebrand factor (VWT) type C repeat, which is 
found 2-4 times in these multi-domain proteins. The VWF domain has a length of 
about 70 amino acids covering 10 well conserved cysteines. The presence of this 
region in complex-forming proteins leads to the assumption that the VWFC 
domain might be involved in forming larger protein complexes. 

Thus the present invention provides by its first aspect, a novel isolated 
nucleic acid molecule comprising or consisting of the sequence SEQ ID NO: 1. 
fragments of said sequence having at least 20 nucleic adds, or a molecule 
comprising a sequence having at least 70%, preferably 80%, and most preferably 
, 90% identity to SEQ ID NO:l. The present invention further provides a protein or 
polypeptide comprising or consisting of an amino acid sequence encoded by any of 
the' above nucleic acid sequences, termed herein "CLH product" , for example, an 
amino acid sequence having the sequence as depicted in SEQ ID NO: 2, fragments 
of the above amino acid sequence having a length of at least 10 amino acids, as 
5 well as homologs of the amino acid sequences SEQ TD NO.:2 in which one or mere 
of the amino acid residues has been substituted (by conservative or 
non-conservative substitution) added, deleted, or chemically modified. 

The present invention further provides nucleic acid molecule comprising or 
consisting of a sequence which encodes the above amino acid sequences. 



(including the fragments and analogs of the amino acid sequences). Due to the 
degenerative nature of the genetic code : a plurality of alternative nucleic acid 
sequences, beyond SEQ ID NO:L can code for the amino acid sequence of the 
invention. Those alternative nucleic acid sequences which code for the same amino 
acid sequences codes by the sequence SEQ ID NO: 1 are also an aspect of the of 

the present invention. 

The present invention further provides expression vectors and cloning 
vectors comprising any of the above nucleic acid sequences, as well as host cells 

transfected by said vectors. 

The present invention still further provides pharmaceutical compositions 
comprising, as an active ingredient, said nucleic acid molecules, said expression 
vectors, or said protein or polypeptide. 

These pharmaceutical compositions are suitable for the treatment of diseases 
and pathological conditions, which can be ameliorated or cured or prevented by 
raising the level of the CLH product. Typically these are diseases which are 
manifested by non-normal levels of various BMP proteins (which can be higher or 
lower than normal levels) which can lead to pathological conditions associated with 
osteoblasts (such as fibrodysplasia ossificans progressiva) or of other tissues of 
mesenchymal origin. The compositions are intended to restore the BMP levels to 
normal levels or to restore the effect of existing levels of BMPs to normal effect.. 

By a second aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a non-coding sequence which is complementary to that 
of SEQ ID NO:l, or complementary to a sequence having at least 70% identity to 
said sequence or a fragment of said two sequences. The complementary sequence 
may be a DNA sequence which hybridizes with the SEQ of ID NO:l or hybridizes 
to a portion of that sequence having a length sufficient to inhibit the transcription of 
the complementary sequence. The complementary sequence may be a DNA 
sequence which can be transcribed into an m RNA being an antisense t o the mRNA 
transcribed from SEQ ID NO:l or into an mRNA which is an antisense to a 
fragment of the mRNA transcribed from SEQ ID NO.:l which has a length 



sufficient 10 hybridize with the mRNA transcribed from SEQ ID NO:. 1. so as to 
inhibit its translation. The complementary sequence may also be the mRNA or the 
fragment of the mRNA itself 

The nucleic acids of the second aspect of the invention may be used for 
therapeutic or diagnostic applications for example for detection of the expression of 
CLH m various tissues such as bones and other tissue of mesenchymal origin. 

The present invention also provides expression vectors comprising any one 
of the above defined complementary nucleic acid sequences and host cells 
trans fected with said nucleic acid sequences or vectors, being complementary to 
those specified in the first aspect of the invention. 

The invention also provides anti-CLH product antibodies, namely antibodies 
directed against the" CLH product which specifically bind to said CLH product. 
Said antibodies are useful both for diagnostic and therapeutic purposes. For 
example said antibody may be as an active ingredient in a pharmaceutical 

: composition as will be explained below. 

The present invention also provides pharmaceutical compositions 
comprising, as an active ingredient, the nucleic acid molecules which comprise or 
consist of said complementary sequences, or of a vector comprising said 
complementary sequences. The pharmaceutical composition thus provides 

0 pharmaceutical compositions comprising, as an active ingredient, said anti-CLH 
product antibodies. 

The pharmaceutical compositions comprising said anti-CLH product 
antibodies or the nucleic acid molecule comprising said complementary sequence, 
are suitable for the treatment of diseases and pathological conditions where a 

.5 therapeutically beneficial effect may be achieved by neutralizing the CLH or 
decreasing the amount of the CLH product or blocking its binding to its target (for 
example its receptor), for example, by the neutralizing effect of the antibodies, or 
bv the decrease of the effect of the antisense mRNA in decreasing expression level 
of the CLH product. Mostly these diseases are manifested by non-normal levels of 

30 BMPs in the diseased persons, or by non-normal effect of BMPs (even at regular 
levels) on their targeting cells. Said non-normal effect is usually higher than normal 
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levels, mav be down-resulated to Droduce normal levels, bv utilizina the 

pharmaceutical compositions of the invention. 

According to the third aspect of the invention the present invention provides 
methods for detecting the level of the transcript (mRNA) of said CLH product in a 
5 body fluid sample, or in a specific tissue sample (notably bone tissue), for example 
by use of probes comprising or consisting of said coding sequences; as well as 
methods for detecting levels of expression of said product in tissue, e.g. by the use 
of antibodies capable of specifically reacting with the above amino acid sequences. 
Detection of the level of the expression of the CLH of the invention may be 
10 indicative of a plurality of physiological or pathological conditions. 

The method, according to this latter aspect, for detection of a nucleic acid 
sequence which encodes the CLH product in a biological sample, comprises the 
steps of: 

(a) providing a probe comprising at least one of the nucleic acid 
15 sequence defined above; 

(b) contacting the biological sample with said probe under conditions 
allowing hybridization of nucleic acid sequences thereby enabling formation of 
hybridization complexes; 

(c) detecting hybridization complexes, wherein the presence of the 
20 complex indicates the presence of nucleic acid sequence encoding the CLH product 

in the biological sample. 

The amount of hybridization complexes may be determined and calibrated 

by comparing it to a calibration scale in order to determine the amount of the 

nucleic acid sequence which enables the CLH product in the sample. 
25 By a preferred embodiment the probe is part of a nucleic acid chip used for 

detection purposes, i.e. the probe is a part of an array of probes each present in a 

known location on a solid support. 

The nucleic acid sequence used in the above method may be a DNA 

sequence an RNA sequence, etc; it may be a coding-or~a-sequenee-or a sequence 
30 complementary thereto (for respective detection of RNA transcripts or 



codine-DNA sequences). By quantization of the level of hybridization complexes 
and calibrating the quantified results it is possible also to detect the level of the 

transcript in the sample. 

Methods for detecting mutations in the region coding for the CLH product 
are also provided, which may be methods camed-out in a binary fashion, namely 
merelv detecting whether there is any mismatches between the normal CLH nucleic 
acid sequence and the one present in the sample, or camed-out by specifically 
detectine the nature and location of the mutation. 

The present invention also concerns a method for detecting CLH product 
both for determining its presence, as well as its level or alterations in its level in a 
biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of the invention. 

therebv forming an "antibody-antigen complex: and 

(b) detecting said antibody-antigen complex 

; wherein the presence of said antibody-antigen complex correlates with the 

presence of CLH product in said biological sample. 

The present invention also concerns a method for detecting anti-CLH 
antibodies in a biological sample comprising the steps or: 

(a) contacting said biological sample with the product of the invention 
o thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of anti-CHL antibody in said biological sample. 

In many cases, diseases are detected not by detecting the presence of the 
_ 5 protein (product) which caused the disease, but rather by detecting the presence in a 
biological sample (such as blood or serum) of antibodies against such a product. 
The method of detecting the presence of anti-CLH antibodies is intended to be used 
in such case. 

The amount of the antibody-antigen complex can be quantitized, in order 10 
30 determine the level of the CHL-product or the anti-CHL antibodies, as the case 
mav be. 



By yet another aspect the invention also provides a method for identifying 
candidate compounds capable of modulating the activity of CLH product (being 
either activators or deactivators). The method includes: 

(i) providing a protein or polypeptide comprising an amino acid 
sequence substantially as depicted in SEQ ID NO: 2. or a fragment of such a 
sequence; 

(ii) contacting a candidate compound with said amino acid sequence; 
(in) comprising the physiological effect of the amino acid sequence in the 

presence and absence of said candidate compound and selecting those compounds 
which show a significant effect on said physiological activity. 

The activity of the amino acid which should be changed by the modulator 
(being either the activator or deactivator) may be for example the binding of the 
CLH product to itSTreceptor, the effect of CLH on BMPs expression or activity. 
Anv modulator which changes such an activity has an infecting potential, as 

servins as an actuator or deactivator. 

> — - 

The present invention also concerns compounds identified by the above 
methods described above, which compound may either be an activator of the CLH 
oroduct or a deactivator thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how it may be carried out in 
practice, a preferred embodiment will now be described, by way of non- limiting 
example only, with reference to the accompanying drawings, in which: 

Fig. 1 is a demonstration of the amino acid sequence depicted in SEQ ED 
NO: 2 coded by the nucleic acid sequence ED NO: 1 ; 

Fig. 2 is alignment of the CLH peptide to known chordin protein, 
demonstrating the homology regions within these proteins. The aignment was 
performed using best- fit of GCG. 



DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 

Example I: CLH - nucleic acid sequence 

The nucleic acid sequences of the invention include nucleic acid 
sequences which encode CLH product and fragments and analogs thereof. The 
nucleic acid sequences may alternatively be sequences complementary to the 
above coding sequence, or to a region of said coding sequence. The length of the 
complementary sequence is sufficient to avoid the expression of the coding 
sequence. The nucleic acid sequences may be in the form of RNA or in the form 
of DNA. and include messenger RNA. synthetic RNA and DNA. cDNA. and 
genomic DNA. The DNA may be double-stranded or single-stranded, and if 
sinsle-stranded may be the coding strand or the non-coding (anti-sense, 
complementary) strand. The nucleic acid sequences may also both include 
dNTPs. rNTPs as well as non naturally occurring sequences. The sequence may 
also be a part of a hybrid between an amino acid sequence and a nucleic acid 
seauence. 

In a general embodiment, the nucleic acid sequence has at least 70%, 
preferably 80% or 90% sequence identity with the sequence identified as SEQ ID 
NOT. 

The nucleic acid sequences may include the coding sequence by itself. By 
another alternative the coding region may be in combination with additional 
coding sequences, such as those coding for fusion protein or signal peptides, in 
combination with non-coding sequences, such as introns and control elements, 
promoter and terminator elements or 5' and/or 3 1 untranslated regions, effective 
for expression of the coding sequence in a suitable host, and/or in a vector or host 
environment in which the CLH nucleic acid sequence is introduced as a 

heterologous sequence. 

The nucleic acid sequences of the present invention may also have the 
product coding sequence fused in-frame to a marker sequence which allows for 
punfication of the CLH product. The marker sequence may be, for example, a 



hexahistidine tag to provide for purification of the mature polypeptide fused to 
the marker in the case of a bacterial host or. the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 cells, is used. The 
HA tag corresponds to an epitope derived from the influenza hemagglutinin 
protein (Wilson, L s et al Cell 31:167 (1984)). 

Also included in the scope of the invention are fragments also referred to 
herein as oligonucleotides, typically having at least 20 bases, preferably 20-30 
bases corresponding to a region of the coding-sequence nucleic acid sequence. 
The fragments may be used as probes, primers, and when complementary also as 
antisense agents, and the like, according to known methods. 

As indicated above, the nucleic acid sequence may be substantially a 
depicted in SEQ ID"NO:l or fragments thereof or sequences having at least 70%, 
preferably 70-80%, most preferably 90% identity to the above sequence. 
Alternatively, due to the degenerative nature of the genetic code, the sequence 
may be a sequence coding the amino acid sequence of SEQ ID NO:2, or 
fragments or analogs of said amino acid sequence. 

A. Preparation of nucleic acid sequences 

The nucleic acid sequences may be obtained by screening cDNA libraries 
using oligonucleotide probes which can hybridize to or PCR-amplify nucleic acid 
sequences which encode the CLH products disclosed above. cDNA libraries 
prepared from a variety of tissues are commercially available and procedures for 
screening and isolating cDNA clones are well-known to those of skill in the art. 
Such techniques are described in, for example, Sambrook et al (1989) Molecular 
Cloning: A Laboratory Manual (2nd Edition), Cold Spring Harbor Press, 
Plainview, N.Y. and Ausubel FM et al. (1989) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, N.Y. 

The nucleic acid sequences may be exte nded to obtain u pstream and 
downstream sequences such as promoters, regulatory elements, and 5' and 3' 
untranslated regions (UTRs). Extension of the available transcript sequence may 



be performed by numerous methods known to those of skill in the an. such as 
PCR or primer extension < Sambrook et ai. supra), or by the RACE method 
usins. for example, the Marathon RACE kit I Clomech. Cat. = K 1802-1). 

Alternatively, the technique of "restriction-site" PCR (Gobinda et ai PCR 
Metnods Avviic. 2:318-22. (1993)1 which uses universal primers to retrieve 
fiankins sequence adjacent a known locus, may be employed. First, genomic 
DNA is amplified in the presence of primer to a linker sequence and a primer 
specific to the known region. The amplified sequences are subjected to a second 
round of PCR with the same linker primer and another specific primer internal to 
the first one. Products of each round of PCR are transcribed with an appropriate 
RNA polymerase and sequenced using reverse transcriptase. 

Inverse PCR can be used to amplify or extend sequences using divergent 
nrimers based on a known region (Triglia. T. et ai. Nucleic Acids Res. 16:8186. 
(1988)). The primers may be designed using OLIGO(R) 4.06 Primer .Analysis 
Software (1992; National Biosciences Inc. Plymouth, Minn.), or another 
appropriate program, to be 22-30 nucleotides in length, to have a GC content of 
50% or more, and to anneal to the target sequence at temperatures about 68-72°C. 
The method uses several restriction enzymes to generate a suitable fragment in 
the known region of a gene. The fragment is then circularized by intramolecular 
ligation and used as a PCR template. 

Capture PCR (Lagerstrom, M. et ai, PCR Methods Applic. 1:111-19, 
(1991)) is a method for PCR amplification of DNA fragments adjacent to a 
known sequence in human and yeast artificial chromosome DNA. Capture PCR 
also requires multiple restriction enzyme digestions and ligations to place an 
engineered double-stranded sequence into a flanking part of the DNA molecule 
before PCR. 

Another method which may be used to retrieve flanking sequences is that 
f Parker. J.D.. et ai, Nucleic Acids Res., 19:3055-60, (1991)). Additionally, one 
PCR. nested primers and PromoterFinder™ libraries to "walk in" genomic 
DNA (PromoterFinder™: Clontech, Palo Alto, CA). This process avoids the need 



o: 

can use 



to screen libraries and is useful in finding intron/exon junctions. Preferred 
libraries for screenina for full length cDNAs are ones that have been size-selected 
to include larger cDNAs. Also, random primed libraries are preferred in that they 
will contain more sequences which contain the 5* and upstream regions of genes. 

A randomly primed library may be particularly useful if an oligo d(T) 
librarv does not vield a full-leneth cDNA. Genomic libraries are useful for 
extension into the 5' nontranslated resulatorv resion. 

The nucleic acid sequences and oligonucleotides of the invention can also 
be prepared by solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized, then 
joined to form continuous sequences up to several hundred bases. 

B. Use of CLH nucleic acid sequence for the production of CLH products 

In accordance with the present invention, nucleic acid sequences specified 
above may be used as recombinant DNA molecules that direct the expression of 
CLH products. 

As will be understood by those of skill in the art, it may be advantageous 
to produce CLH product-encoding nucleotide sequences possessing codons other 
than those which appear in SEQ ID NO:l which are those which naturally occur 
in the human genome. Codons preferred by a particular prokaryotic or 
eukaryotiCLHost (Murray, E. et al Nuc Acids Res., 17:477-508, (1989)) can be 
selected, for example, to increase the rate of CLH product expression or to 
produce recombinant RNA transcripts having desirable properties, such as a 
longer half-life, than transcripts produced from naturally occurring sequence. 

The nucleic acid sequences of the present invention can be engineered in 
order to alter a CLH product coding sequence for a variety of reasons, including 
but not limited to, alterations which modify the cloning , proces sing and/or 
expression of the product. For example, alterations may be introduced using 
techniques which are well known in the art, e.g., site-directed mutagenesis, to 



insen new restriction sues, to alter glycosylate patterns, to change codon 
preference, to produce splice variants, etc. 

The present invention also includes recombinant constructs comprising 
one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 
sequence of the invention has been inserted, in a forward or reverse orientation. 
In a preferred aspect of this embodiment, the construct further comprises 
regulatory sequences, including, for example, a promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those 
of skill in the art. and are commercially available. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotiCLHosts are also 

described in Sambrook, era/., (supra). 

The present invention also relates to host cells which are genetically 
eneineered with vectors of the invention, and the product.on of the product of the 
invention bv recombinant techniques. Host cells are genetically engineered (i.e.. 
transduced, transformed or transfected) with the vectors of this invention which 
mav be. for example, a cloning vector or an expression vector. The vector may 
be.' for example, in the form of a plasmid. a viral particle, a phage, etc. The 
entered host cells can be cultured in conventional nutrient media modified as 
, appropriate for activating promoters, selecting transformants or amplifying the 
expression of the CLH nucleic acid sequence. The culture conditions, such as 
temperature. P H and the like, are those previously used with the host cell selected 
for expression, and will be apparent to those skilled in the art. 

The nucleic acid sequences of the present invention may be included in 
„ anv one of a variety of expression vectors for expressing a product. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40: bacterial plasmids: phage DNA; baculovirus; yeast 
plasmids: vectors derived from combinations of plasmids and phage DNA, viral 
DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However. 
, 0 anv other vector may be used as long as it is replicable and viable in the host. 
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Th e appropriate DNA sequence may be inserted into the vector by a variety- of 
procedures. In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures known in the an. Such procedures 
and related sub-cloning procedures are deemed to be within the scope of those 
5 skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct mRNA synthesis. 
Examples of such promoters include: LTR or SV40 promoter, the E.coli lac or 
trp promoter, the phage lambda PL promoter, and other promoters known to 
10 control expression of genes in prokaryotic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site for translation 
initiation, and a -transcription terminator. The vector may also include 
appropriate sequences for amplifying expression. In addition, the expression 
vectors preferably contain one or more selectable marker genes to provide a 
15 phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E.coli. 

The vector containing the appropriate DNA sequence as described above, 
as well as an appropriate promoter or control sequence, may be employed to 
20 transform an appropriate host to permit the host to express the protein. Examples 
of appropriate expression hosts include: bacterial cells, such as E.coli, 
Streptomyces, Salmonella typhimurium: fungal cells, such as yeast; insect cells 
such as Drosophila and Spodoptera Sf9; animal cells such as CHO, COS, HEK 
293 or Bowes melanoma; adenoviruses: plant cells, etc. The selection of an 
25 appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. The invention is not limited by the host cells employed. 

In bacterial systems, a number of expression vectors may be selected 
depending upon the use intended for the CLH produ_ct,_Fpx^xample 3 when large 
quantities of CLH product are needed for the induction of antibodies, vectors 
30 which direct high level expression of fusion proteins that are readily purified may 



be desirable. Such vectors include, but are not limited to. multifunctional E.coli 
clonins and expression vectors such as BluescripttR) (Stratagene). in which the 
CLH polvpeptide coding sequence may be ligated into the vector in-frame with 
sequences for the amino-terminal Met and the subsequent " residues of 
beta-galactosidase so that a hybrid protein is produced: pIS vectors (Van Heeke 
& Schuster J. Biol. Chem. 264:5503-5509. (1989)); pET vectors (Novagen. 
Madison WI); and the like. 

In the veast Saccharomvces cerevisiae a number of vectors containing 
.constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH mav be used. For reviews, see Ausubel et al (supra; and Grant et al. 
{Methods in Enzymologv 153 :516-544, (1987)). 

In cases where plant expression vectors are used, the expression of a 
sequence encoding CLH product may be driven by any of a number of promoters. 
For example, viral promoters such as the 35S and 19S promoters of CaMV 
(Brisson et al, Nature 310:51 1-514. (1984)) may be used alone or in combination 
with the omega leader sequence from TMV (Takamatsu et al, EMBO 
6:307-311, (1987)). Alternatively, plant promoters such as the small subunit of 
RUBISCO (Coruzzi et al, EMBO 1 3:1671-1680, (1984); Broglie et al. Science 
224:838-843, (1984)); or heat shock promoters (Winter J and Sinibaldi R.M., 
Results Probl Cell Differ., 12:85-105. (1991)) may be used. These constructs 
can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection. For reviews of such techniques, see Hobbs S. or 
Murry L.E. (1992) in McGraw Hill Yearbook of Science and Technology, 
McGraw Hill, New York, N.Y., pp 191-196: or Weissbach and Weissbach (1988) 
Methods for Plant Molecular Biology, Academic Press, New York, N.Y., pp 
421-463. 

CLH product may also be expressed in an insect system. In one such 
system. Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia 
larvae. The CLH product coding sequence may be cloned into a nonessential 
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region of the virus, such as the polyhedrin gene, and placed under control of the 
polyhedrin promoter. Successful insertion of CLH coding sequence will render 
the polyhedrin gene inactive and produce recombinant virus lacking coat protein 
coat. The recombinant viruses are then used to infect S. frugiperda cells or 
5 Trichoplusia larvae in which CLH protein is expressed (Smith et aL, J. Virol. 
46:584. (1983); Engelhard, EX. et aL, Proc. Nat. Acad. Set 91:3224-7. (1994)). 

In mammalian host cells, a number of viral-based expression systems may 
be utilized. In cases where an adenovirus is used as an expression vector, a CLH 
product coding sequence may be ligated into an adenovirus 

10 transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a nonessential El or E3 region of the viral genome 
will result in a viaTjle virus capable of expressing CLH protein in infected host 
cells (Logan and Shenk, Proc, Natl. Acad. Set 81:3655-59, (1984). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 

15 used to increase expression in mammalian host cells. 

Specific initiation signals may also be required for efficient translation of 
a CLH protein coding sequence. These signals include the ATG initiation codon 
and adjacent sequences. In cases where CLH product coding sequence, its 
initiation codon and upstream sequences are inserted into the appropriate 

20 expression vector, no additional translational control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous transcriptional control signals including the ATG initiation codon 
must be provided. Furthermore, the initiation codon must be in the correct 
reading frame to ensure transcription of the entire insert. Exogenous 

25 transcriptional elements and initiation codons can be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of enhancers appropriate to the cell system in use (Scharf, D. et aL, 
(1994) Results ProbL Cell Differ., 20:125-62, (l994)^it&or-et^l^Methods in 
Enzymol 153:516-544, (1987)). 



In a farther embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
eukarvotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
veast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 

5 Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection. DEAE-Dextran mediated transfection. or electroporation 
(Davis. L.. Dibner. M.. and Battey. I. (1986) Basic Methods in Molecular 
Biology). Cell-free translation systems can also be employed to produce 
polypeptides using RNAs derived from the DNA constructs of the present 

10 invention. 

A host cell strain may be chosen for its ability to modulate the expression 
of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the protein include, but are not limited to, 
acetylation, carboxylation. glycosylation, phosphorylation, lipidation and 

15 acylation. Post-translational processing which cleaves a "pre-pro" form of the 
protein may also be important for correct insertion, folding and/or function. 
Different host cells such as CHO, HeLa ? MDCK ? 293. WIS 8, etc. have specific 
cellular machinery and characteristic mechanisms for such post-translational 
activities and may be chosen to ensure the correct modification and processing of 

20 the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, cell lines which stably express CLH 
product may be transformed using expression vectors which contain viral origins 
of replication or endogenous expression elements and a selectable marker gene. 

25 Following the introduction of the vector, cells may be allowed to grow for 1-2 
davs in an enriched media before they are switched to selective media. The 
purpose of the selectable marker is to confer resistance to selection, and its 
presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clumps of stably transformed cells can be 

30 proliferated using tissue culture techniques appropriate to the cell type. 



Anv number of selection svstems mav be used to recover transformed cell 
lines. These include, but are not limited to. the herpes simplex virus thymidine 
kinase (Wigler M., et al : Cell 11:223-32, (1977)) and adenine 
phosphoribosyltransferase (Lo\vy I., et al T Cell 22:817-23. (1980)) genes which 
can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, 
antibiotic or herbicide resistance can be used as the basis for selection: for 
example, dhfr which confers resistance to methotrexate (Wigler M.. et al, Proc. 
Natl Acad. ScL 77:3567-70. (1980)); npt, which confers resistance to the 
aminoglycosides neomycin and G-418 (Colbere-Garapin, F. et al 3 J. Mol Biol, 
150 :1-14, (1981)) and ah or pat, which confer resistance to chlorsulfuron and 
phosphinotricin acetyltransferase, respectively (Murry, supra). Additional 
selectable genes have- been described, for example. trpB. which allows cells to 
utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol 
in place of histidine (Hartman S.C. and R.C. Mulligan. Proc. Natl Acad. ScL 
85:8047-51, (1988)). The use of visible markers has gained popularity with such 
markers as anthocyanins. beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrates, luciferin and ATP, being widely used not only to 
identify transformants, but also to quantify the amount of transient or stable 
protein expression attributable to a specific vector system (Rhodes, C A. et. al, 
Methods Mol Biol, 55:121-131, (1995)). 

Host cells transformed with a nucleotide sequence encoding CLH product 
may be cultured under conditions suitable for the expression and recovery of the 
encoded protein from cell culture. The product produced by a recombinant cell 
may be secreted or contained intracellularly depending on the sequence and/or the 
vector used. As will be understood by those of skill in the art, expression vectors 
containing nucleic acid sequences encoding CLH product can be designed with 
signal sequences which direct secretion of CLH product through a prokaryotic or 

eukaryotic cell membrane. 

CLH product may also be expressed as a recombinant protein with one or 
more additional polypeptide domains added to facilitate protein purification. 



Such purification facilitating domains include, but are not limited to. metal 
chelating peptides such as histidine-tryptophan modules that allow purification on 
immobilized metals, protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension affinity 

5 purification system (Immunex Corp. Seattle. Wash.). The inclusion of a 
protease-cleavable polypeptide linker sequence between the purification domain 
and CLH protein is useful to facilitate purification. One such expression vector 
provides for expression of a fusion procein compromising a CLH polypeptide 
fused to a polyhistidine region separated by an enterokinase cleavage site. The 

10 histidine residues facilitate purification on IMIAC (immobilized metal ion 
affinity chromatography, as described in Porath. et aL. Protein Expression and 
Purification, 3:263~-"281. (1992)) while the enterokinase cleavage site provides a 
means for isolating CLH polypeptide from the fusion protein. pGEX vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as 

15 fusion proteins with glutathione S-transferase (GST), in general, such fusion 
proteins are soluble and can easily be purified from lysed cells by adsorption to 
ligand-agarose beads (e.g.. glutathione-agarose in the case of GST-fusions) 
followed by elution in the presence of free ligand. 

Following transformation of a suitable host strain and growth of the host 

20 strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g.. temperature shift or chemical induction) and cells are 
cultured for an additional period. Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting crude extract retained 
for further purification. Microbial cells employed in expression of proteins can 

25 be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents, or other methods, 
which are well know* to those skilled in the art. 

The CLH products can be recovered and purified from recombinant cell 
cultures bv anv of a number of methods well known in the art. including 

30 ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
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exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography. affinity chromatography. hydroxylapatite 
chromatography, and lectin chromatography. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature protein. Finally, high 
5 performance liquid chromatography (HPLC) can be employed for final 
purification steps. 

C. Diagnostic applications utilizing nucleic acid sequences 

v The nucleic acid sequences of the present invention may be used for a 

10 variety of diagnostic purposes. The nucleic acid sequences may be used to detect 
and quantitate expression of CLH in patient's cells, e.g. biopsied tissues, by 
detecting the presence of mRNA coding for CLH product. Alternatively, the 
assav mav be used to detect soluble CLH in the serum or blood. This assay 
typically involves obtaining total mRNA from the tissue or serum and contacting 

15 the mRNA with a nucleic acid probe. The probe is a nucleic acid molecule of at 
least 20 nucleotides, preferably 20-30 nucleotides, capable of specifically 
hybridizing with a sequence included within the sequence of a nucleic acid 
molecule encoding CLH under hybridizing conditions, detecting the presence of 
mRNA hybridized to the probe, and thereby detecting the expression of CLH. 

20 This assay can be used to distinguish between absence, presence, and excess 
expression of CLH product and to monitor levels of CLH expression during 
therapeutic intervention. 

The invention also contemplates the use of the nucleic acid sequences as a 
diagnostic for diseases resulting from inherited defective CLH sequences. These 

25 sequences can be detected by comparing the sequences of the defective (i.e., 
mutant) CLH coding region with that of a normal coding region. Association of 
the sequence coding for mutant CLH product with abnormal CLH product 
activity may be verified. In addition, sequences""encoxiingTnutant-CbH products 
can be inserted into a suitable vector for expression in a functional assay system 

30 (e.g., colorimetric assay, complementation experiments in a CLH protein 
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dencient strain of HEK293 cells) as yet another means to verify or identify 
mutations. Once mutant genes have been identified, one can then screen 
populations of interest for carriers of the mutant gene. 

Individuals carrying mutations in the nucleic acid sequence of the present 

5 invention may he detected at the DNA level by a variety of techniques. Nucleic 
acids used for diagnosis may be obtained from a patient's cells, including but not 
limited to such as from blood, urine, saliva, placenta, tissue biopsy and autopsy 
material. Genomic DNA mav be used directlv for detection or mav be amplified 
enzymatically by using PCR (Saiki. et aL, Nature 324 :163-166. (1986)) prior to 

io analysis. RNA or cDNA may also be used for the same purpose. As an example. 
PCR primers complementary to the nucleic acid of the present invention can be 
used to identify and analyze mutations in the gene of the present invention. 
Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. 

15 Point mutations can be identified by hybridizing amplified DNA to 

radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA 
sequences of the invention. Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such RNase and SI protection or the 
chemical cleavase method (e.e. Cotton, et alProc. Natl Acad. ScL USA. 

20 85:4397-4401, (1985)), or by differences in melting temperatures. "Molecular 
beacons" (Kostrikis L.G. et aL, Science 279:1228-1229, (1998)), hairpin-shaped, 
single-stranded synthetic oligo- nucleotides containing probe sequences which 
are complementary to the nucleic acid of the present invention, may also be used 
to detect point mutations or other sequence changes as well as monitor expression 

25 levels of CLH product. Such diagnostics would be particularly useful for 
prenatal testing. 

Another method for detecting mutations uses two DNA probes which are 
designed to hybridize to adjacent regions of a target, with abutting bases, where 
the region of known or suspected mutation(s) is at or near the abutting bases. 
30 The two probes may be joined at the abutting bases, e.g., in the presence of a 
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ligase enzyme, but only if both probes are correctly base paired in the region of 
probe junction. The presence or absence of mutations is then detectable by the 
presence or absence of ligated probe. 

Also suitable for detecting mutations in the CLH product coding sequence 
5 are oligonucleotide array methods based on sequencing by hybridization (SBH) ; 
as described, for example, in U.S. Patent No. 5.547.839. In a typical method, the 
DNA target analyte is hybridized with an array of oligonucleotides formed on a 
microchip. The sequence of the target can then be "read" from the pattern of 
target binding to the array. 

10 

D. Gene mapping utilizing nucleic acid sequences 

The nucleic~~acid sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 

15 Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking chromosomal 
location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes 

20 associated with disease. 

Briefly, sequences can be mapped to chromosomes by preparing PCR 
primers (preferably 20-30 bp) from the CLH cDNA. Computer analysis of the 3' 
untranslated region is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, which would complicate the amplification 

25 process. These primers are then used for PCR screening of somatic cell hybrids 
containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the primer will yield an amplified fragment. 

PCR mapping of somatic cell hybrids or^sing-instead-r-adiation-hybrids 
are rapid procedures for assigning a particular DNA to a particular chromosome. 

30 Using the present invention with the same oligonucleotide primers, 



sublecaiizaiion can be achieved with panels of fragments from specific 
chromosomes or pools of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow-sorted chromosomes and 
preselection by hybridization to construct chromosome specific-cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase 
chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with cDNA as short as 50 or 60 bases. For a 
review of this technique, see Verm a et a!., Human Chromosomes: a Manual of 
Basic Techniques. (1988) Pergamon Press. New York. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in the OMIM database 
(Center for Medical Genetics. Johns Hopkins University. Baltimore, MD and 
National Center for Biotechnoloev Information. National Library of Medicine. 
Bethesda. MD). The OMIM gene map presents the cytogenetic map location of 
disease genes and other expressed genes. The OMIM database provides 
information on diseases associated with the chromosomal location. Such 
associations include the results of linkage analysis mapped to this interval, and 
the correlation of translocations and other chromosomal aberrations in this area 
with the advent of various diseases associated with abnormal amounts or function 
of BMPs. 

E. Therapeutic applications of nucleic acid sequences 

Nucleic acid sequences of the invention may also be used for therapeutic 
purposes. Turning first to the second aspect of the invention (i.e. inhibition of 
expression of CLH) ? expression of CLH product may be modulated through 
antisense technology, which controls gene expression through hybridization of 
complementary nucleic acid sequences, i.e. antisense DNA or RNA, to the 
control. 5' or regulatory regions of the gene encoding CLH product. For 



example, the 5' coding portion of the nucleic acid sequence sequence which 
codes for the product of the present invention is used to design an antisense 
oligonucleotide of from about 10 to 40 base pairs in length. Oligonucleotides 
derived from the transcription CLHt site. e.g. between positions -10 and +10 
5 from the CLHt site, are preferred. An antisense DNA oligonucleotide is designed 
to be complementary to a region of the nucleic acid sequence involved in 
transcription (Lee et al t Nucl Acids, Res., 6:3073. (1979); Cooney et al. ? Science 
241:456, (1988); and Dervan et ai, Science 251:1360. (1991)), thereby 
preventing transcription and the production of the CLH products. An antisense 
io RNA oligonucleotide hybridizes to the mRNA in vivo and blocks translation of 
the mRNA molecule into the CLH products (Okano J. Neurochem. 56:560, 
(1991)). The antisense' constructs can be delivered to cells by procedures known 
in the an such that the antisense RNA or DNA may be expressed in vivo. The 
antisense may be antisense mRNA or DNA sequence capable of coding such 
15 antisense mRNA. The antisense mRNA or the DNA coding thereof can be 
complementary to the full sequence of nucleic acid sequences coding to the CLH 
protein or to a fragment of such a sequence which is sufficient to inhibit 
production of a protein product. 

Turning now to the first aspect of the invention, i.e. expression of CLH, 
20 expression of CLH product may be increased by providing coding sequences for 
coding for said product under the control of suitable control elements ending its 
expression in the desired host. 

The nucleic acid sequences of the invention may be employed in 
combination with a suitable pharmaceutical carrier. Such compositions comprise 
25 a therapeutically effective amount of the compound, and a pharmaceutically 
acceptable carrier or excipient. Such a carrier includes but is not limited to saline, 
buffered saline, dextrose, water, glycerol, ethanol. and combinations thereof. The 

formulation should suit the mode of administration. - - 

The products of the invention as well as any activators and deactivators 
30 compounds (see below) which are polypeptides, may also be employed in 



accordance w ith the present invention by expression of such polypeptides in vivo. 
which is often referred to as "gene therapy. " Cells from a patient may be 
engineered with a nucleic acid sequence (DNA or RNA) encoding a polypeptide 
ex vivo, with the engineered cells then being provided to a patient to be treated 

5 with the polypeptide. Such methods are well-known in the art. For example, cells 
ma} - be engineered by procedures known in the art by use of a retroviral particle 
containing RNA encoding a polypeptide cf the present invention. 

Similarly, cells may be engineered in vivo for expression of a polypeptide 
in vivo by procedures known in the art. As known in the art. a producer cell for 

10 producing a retroviral particle containing RNA encoding the polypeptide of the 
present invention may be administered to a patient for engineering cells in vivo 
and expression of" the polypeptide in vivo. These and other methods for 
administering a product of the present invention by such method should be 
apparent to those skilled in the art from the teachings of the present invention. 

15 For example, the expression vehicle for engineering cells may be other than a 
retrovirus, for example, an adenovirus which may be used to engineer cells in 
vivo after combination with a suitable delivery vehicle. 

Retroviruses from which the retroviral plasmid vectors mentioned above 
may be derived include, but are not limited to, Moloney Murine Leukemia Virus, 

20 spleen necrosis virus, retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma 
Virus, avian leukosis virus, gibbon ape leukemia virus, human immunodeficiency 
virus, adenovirus, Myeloproliferative Sarcoma Virus, and mammary tumor virus. 

The retroviral plasmid vector is employed to transduce packaging cell 
lines to form producer cell lines. Examples of packaging cells which may be 

25 transfected include, but are not limited to, the PE501, PAS 17, psi-2, psi-AM, 
PA 12, T19-14X, VT-19-17-H2, psi-CRE, psi-CRIP, GP^E-86, GP^-envAml2, 
and DAN cell lines as described in Miller (Human Gene Therapy, Vol. 1, pg. 
5-14. (1990)). The vector may transduce the packaging cells through any means 
known in the art. Such means include, but are not limited to, electroporation, the 

30 use of liposomes, and CaP04 precipitation. In one alternative, the retroviral 
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plasmid vector may be encapsulated into a liposome, or coupled to a lipid, and 
then administered to a host. 

The producer cell line generates infectious retroviral vector particles 
which include the nucleic acid sequence(s) encoding the polypeptides. Such 
5 retroviral vector particles then may be employed, to transduce eukaryotic cells, 
either in vitro or in vivo. The transduced eukaryotic cells will express the nucleic 
acid sequence(s) encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to. embryonic stem cells, embryonic 
carcinoma cells, as well as hematopoietic stem cells, hepatocytes, fibroblasts. 
10 myoblasts, keratinocytes, endothelial cells, and bronchial epithelial cells. 

The genes introduced into ceils may be placed under the control of 
inducible promoters; such as the radiation-inducible Egr-1 promoter. (Maceri, 
H.J., et al. f Cancer Res., 56(19):43 1 1 (1996)), to stimulate CLH production or 
antisense inhibition in response to radiation, eg., radiation therapy for treating 
15 tumors. 

Example II. CLH product 

The substantially purified CLH product of the invention has been defined 
above as the product coded from the nucleic acid sequence of the invention. 

20 Preferably the amino acid sequence is an amino acid sequence having at least 
70%, preferably at least 80% or 90% identity to the sequence identified as SEQ 
ID NO:2. The protein or polypeptide may be in mature and/or modified form, 
also as defined above. Also contemplated are protein fragments having at least 
10 contiguous amino acid residues, preferably at least 10-20 residues, derived 

25 from the CLH product. 

The sequence variations are preferably those that are considered conserved 
substitutions, as defined above. Thus, for example, a protein with a sequence 
having at least_80% sequence identity with the _protein_identified-as SEQ ID 
NO:2, preferably by utilizing conserved substitutions as defined above is also part 

30 of the invention. In a more specific embodiment, the protein has or contains the 



sequence identified SEQ ID N0:2. The CLH product may be ti ) one in which 
one or more of the amino acid residues in a sequence listed above are substituted 
with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue), or i ii) one in which one or more of the amino acid residues 
includes a substituent group, or (iii) one in which the CLH product is fused with 
another compound, such as a compound to increase the half-life of the protein 
(for example, polyethylene glycol (PEG)), or a moiety which serves as targeting 
means to direct the protein to its target tissue or target cell population (such as an 
antibodv). or (iv) one in which additional amino acids are fused to the CLH 
product. Such fragments, variants and derivatives are deemed to be within the 
scope of those skilled in the art from the teachings herein. 

A. Preparation cf CLH product 

Recombinant methods for producing and isolating the CLH product and 
fragments of the protein are described above. 

In addition to recombinant production, fragments and portions of CLH 
product may be produced by direct peptide synthesis using solid-phase techniques 
(cf. Stewart et a/., (1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San 
Francisco; Merrifield J., J. Am. Chem. Soc, 85:2149-2154, (1963)). In vitro 
peptide synthesis may be performed using manual techniques or by automation. 
Automated svnthesis mav be achieved, for example, using Applied Biosystems 
431 A Peptide Synthesizer (Perkin Elmer, Foster City, Calif.) in accordance with 
the instructions provided by the manufacturer. Fragments of CLH product may be 
chemically synthesized separately and combined using chemical methods to 
produce the full length molecule. 

B. Therapeutic uses and compositions utilizing the CLH product 

The CLH product of the invention is generally useful in treating diseases 
and disorders which are characterized by a lower than normal level of CLH 
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expression, and or diseases which can be cured or ameliorated by raising the level 
of the CLH product, even if the level is normal. 

Typically these diseases are in CLH products or fragments and may be 
administered by any of a number of routes and methods designed to provide a 

5 consistent and predictable concentration of compound at the target organ or 
tissue. The product-containing compositions may be administered alone or in 
combination with other agents, such as stabilizing compounds, and/or in 
combination with other pharmaceutical agents such as drugs or hormones. 
^ CLH product-containing compositions may be administered by a number 

10 of routes including, but not limited to oral, intravenous, intramuscular, 
transdermal, subcutaneous, topical, sublingual, or rectal means as well as by nasal 
application. CLH product-containing compositions may also be administered via 
liposomes. Such administration routes and appropriate formulations are generally 
known to those of skill in the art. 

15 The product can be given via intravenous or intraperitoneal injection. 

Similarly, the product may be injected to other localized regions of the body. The 
product may also be administered via nasal insufflation. Enteral administration is 
also possible. For such administration, the product should be formulated into an 
appropriate capsule or elixir for oral administration, or into a suppository for 

20 rectal administration. 

The foregoing exemplary administration modes will likely require that the 
product be formulated into an appropriate carrier, including ointments, gels, 
suppositories. Appropriate formulations are well known to persons skilled in the 
art. 

25 Dosage of the product will vary, depending upon the potency and 

therapeutic index of the particular polypeptide selected. 

A therapeutic composition for use in the treatment method can include the 

product in a sterile injectable solution, the polypeptide-in-an-oral-deliv-ery. vehicle. 

the product in an aerosol suitable for nasal administration, or the product in a 
30 nebulized form, all prepared according to well known methods. Such 
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compositions comprise a therapeutically effective amount of the compound, and a 
pharmaceutically acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, ethanol. and 
combinations thereof. The product of the invention may also be used to modulate 
5 endothelial differentiation and proliferation as well as to modulate apoptosis 
either e?c vixo or in vitro, for example, in cell cultures. 

Example III. Screening methods for activators and deactivators (inhibitors) 

10 The present invention also includes an assay for identifying molecules, 

su^h as synthetic drugs, antibodies, peptides, or other molecules, which have a 
modulating effect on the activity of the CLH product e.g. activators or 
deactivators of the CLH product of the present invention. Such an assay 
comprises the steps of providing an CLH product encoded by the nucleic acid 

15 sequences of the present invention and determining its physiological activity on 
the target in the presence and absence of one or more candidate molecules to 
determine the candidate molecules. Those molecules which are modulating effect 
on the activity of the CLH product are selected as likely candidates for activators 
and deactivators. 

20 CLH product its catalytic or immunogenic fragments or oligopeptides 

thereof, can be used for screening therapeutic compounds in any of a variety of 
drug screening techniques. The fragment employed in such a test may be free in 
solution, affixed to a solid support, borne on a cell membrane or located 
intracellular!}'. The formation of binding complexes, between CLH product and 

25 the agent being tested, may be measured. Alternatively, the activator or 
deactivator mav work bv servina as aeonist or antagonist, respectively of the 
CLH receptor and their effect may be determined in connection with the receptor. 

Another technique for drug screening which may be used provides for 
high throughput screening of compounds having suitable binding affinity to the 

30 CLH product is described in detail by Geysen in PCT Application WO 84/03564. 
published on Sep. 13. 1984. In summary, large numbers of different small 



peptide test compounds are synthesized on a solid substrate, such as plastic pins 
or some other surface. The peptide test compounds are reacted with the full CLH 
product or with fragments of CLH product and washed. Bound CLH product is 
then detected by methods well known in the art. Substantially purified CLH 
product can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to 
capture the peptide and immobilize it on a solid support. 

Antibodies to the CLH product, as described in Example IV below, may 
also be used in screening assays according to methods well known in the art. For 
example, a "sandwich" assay may be performed, in which an anti-CLH antibody 
is affixed to a solid surface such as a microtiter plate and CLH product is added. 
Such an assay can be used to capture compounds which bind to the CLH product. 
Alternatively, such an assay may be used to measure the ability of compounds to 
influence with the binding of CLH product to the CLH receptor [ I and then 
select those compounds which effect the binding. 

Example TV. Anti-CLH antibodies 
A. Synthesis 

In still another aspect of the invention, the purified CLH product is used to 
produce anti-CLH antibodies which have diagnostic and therapeutic uses related 
to the activity, distribution, and expression of the CLH product, in particular 
therapeutic applications in modulating the effect of CLH on BMP proteins. 

Antibodies to CLH product may be generated by methods well known in 
the art. Such antibodies may include, but are not limited to, polyclonal, 
monoclonal, chimeric, humanized, single chain, Fab fragments and fragments 
produced by an Fab expression library. Antibodies, i.e., those which inhibit 
dimer formation, are especially preferred for therapeutic use. 

A fragment CLH product for antibody i nduction does not requ ire 
biological activity but have to feature immunological activity; however, the 
protein fragment or oligopeptide must be antigenic. Peptides used to induce 



specific antibodies may have an amino acid sequence consisting of at least five 
ammo acids, preferably at least 10 amino acids of the sequences specified in SEQ 
ID NO: 2. Preferably they should mimic a portion of the amino acid sequence of 
the natural protein and may contain the entire amino acid sequence of a small, 
naturally occurring molecule. Short stretches of CLH protein amino acids may be 
fused with those of another protein such as keyhole limpet hemocyanin and 
antibody produced against the chimeric molecule. Procedures well known in the 
an can be used for the production of antibodies to CLH product. 
^ For the production of antibodies, various hosts including goats, rabbits, 
rats. mice, etc may be immunized by injection with CLH product or any portion, 
fragment or oligopeptide which retains immunogenic properties. Depending on 
the host species, 'various adjuvants may be used to increase immunological 
response. Such adjuvants include but are not limited to Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin. and dinitrophenol. BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are potentially useful human adjuvants. 

Monoclonal antibodies to CLH protein may be prepared using any 
technique which provides for the production of antibody molecules by continuous 
ceil lines in culture. These include but are not limited to the hybridoma technique 
orisinallv described bv Koehler and Milstein (Nature 256:495-497, (1975)), the 
human B-cell hybridoma technique (Kosbor et aL, Immunol. Today 4:72, (1983); 
Cote ei aL, Proc. Natl Acad ScL 80:2026-2030, (1983)) and the EBV-hybridoma 
technique (Cole, era/., Mol Cell Biol 62:109-120, (1984)). 

Techniques developed for the production of "chimeric antibodies", the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity can also be used 
(Morrison et aL, Proc. Natl. Acad. ScL 81:6851-6855, (1984); Neuberger et aL, 
Nature 312:604-608. (1984); Takeda et aL, Nature 314:452-454, (1985)). 
Alternatively, techniques described for the production of single chain antibodies 
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(U.S. Pat. No. 4 r 946 ? 778) can be adapted to produce single-chain antibodies 

specific for the CLH protein. 

Antibodies may also be produced by inducing in vivo production in the 

lymphocyte population or by screening recombinant immunoglobulin libraries or 
5 panels of highly specific binding reagents as disclosed in Orlandi et aL (Proc. 

Natl. Acad. ScL 86:3833-3837, 1989)), and Winter G and Milstein C. (Nature 

349:293-299, (1991)). 

Antibody fragments which contain specific binding sites for CLH protein 

mav also be generated. For example, such fragments include, but are not limited 
10 to, the F(ab')2 fragments which can be produced by pepsin digestion of the 

antibodv molecule and the Fab fragments which can be generated by reducing the 

disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries 

may be constructed to allow rapid and easy identification of monoclonal Fab 

fragments with the desired specificity (Huse W.D. et aL, Science 256:1275-1281. 
15 (1989)). 

B. Diagnostic applications of antibodies 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 

20 specificities are well known in the art. Such immunoassays typically involve the 
formation of complexes between CLH product and its specific antibody and the 
measurement of complex formation. A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two noninterfering epitopes on a 
specific CLH product is preferred, but a competitive binding assay may also be 

25 employed. These assays are described in Maddox D.E., et aL, {J. Exp. Med. 
158:1211,(1983)). 

Antibodies which specifically bind CLH product are useful for the 
diagnosis of conditions or diseases characterized-by-ever-or-under-expression-of 
CLH. Alternatively, such antibodies may be used in assays to monitor patients 
30 being treated with CLH product, its activators, or its deactivators. Diagnostic 



— JO — 



assays for CLH protein include methods utilizing the antibody and a label to 
detect CLH product in human body fluids or extracts of cells or tissues. The 
products and antibodies of the present invention may be used with or without 
modification. Frequently, the proteins and antibodies will be labeled by joining 

5 them, either covalently or noncovalently. with a reporter molecule. A wide 
variety of reporter molecules are known in the art. 

A variety of protocols for measuring CLH product, using either polyclonal 
or monoclonal antibodies specific for the respective protein are known in the art. 
Examples include enzyme-linked immunosorbent assay (ELISA). 

10 radioimmunoassay (RIA). and fluorescent activated cell sorting (FACS). As 
noted above, a two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive~to~two non- interfering epitopes on CLH product is preferred, 
but a competitive binding assay may be employed. These assays are described, 
among other places, in Maddox, et al (supra). Such protocols provide a basis for 

15 diagnosing altered or abnormal levels of CLH product expression. Normal or 
standard values for CLH product expression are established by combining body 
or cell extracts taken from normal subjects, preferably human, with antibody to 
CLH product under conditions suitable for complex formation which are well 
known in the art. The amount of standard complex formation may be quantified 

20 by various methods, preferably by photometric methods. Then, standard values 
obtained from normal samples may be compared with values obtained from 
samples from subjects potentially affected by disease. Deviation between 
standard and subject values establishes the presence of disease state. 

The antibody assays are useful to determine the level of CLH present in a 

25 body fluid sample, in order to determine whether it is being overexpressed or 
underexpressed in the tissue, or as an indication of how CLH levels are 
responding to drug treatment. 

.Another alternative is to determine the presence and/or level of naturally 
occurring anti-CLH antibodies in a sample, such as blood or serum. Many times 

30 diseases are identified by detecting the presence or level of antibodies against a 
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specific product. For the detection of such naturally occurring anti-CLH 
antibodies, the sample may be contacted with the product of the invention, for 
example as depicted in SEQ ID NO: 2. or with an antigenic fragment thereof, and 
the presence or level of antibody-antigen complexes may be determined bv 
5 methods well known in the an. 

C. Therapeutic uses of antibodies 

In addition to their diagnostic use the antibodies may have a therapeutical 
utility- in blocking or decreasing the activity of the CLH product in pathological 

10 conditions where beneficial effect can be achieved by such a decrease. 

The antibody employed is preferably a humanized monoclonal antibody, 
or a human Mab~~~pfoduced by known globulin-gene library methods. The 
antibody is administered typically as a sterile solution by IV injection, although 
other parenteral routes may be suitable. Typically, the antibody is administered 

15 in an amount between about 1-15 mg/kg body weight of the subject. Treatment is 
continued. e.g. 5 with dosing every 1-7 days, until a therapeutic improvement is 
seen. 

Although the invention has been described with reference to specific 
methods and embodiments, it is appreciated that various modifications and 
4*v 20 changes may be made without departing from the invention. 
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CLAIMS: 

1. An isolated nucleic acid sequence selected from the group consisting of: 

( i) the nucleic acid sequence depicted in SEQ ID NO: 1 : 

(ii) nucleic acid sequences having at least 70% identity with the 
5 sequence of (i); and 

(iii) fragments of (i) or (ii) of at least 20 b.p. 

2. A nucleic acid sequence according to Claim l(ii) wherein the nucleic acid 
sequences have at least 80% identity with the sequence of Claim l(i). 

3. A nucleic acid sequence according to Claim 2. wherein the nucleic acid 
io sequences have at least 90% identity. 

4. .An isolated nucleic acid sequence complementary to the nucleic acid 
sequence of Claim 1. 

5. An amino acid sequence selected from the group consisting of: 

fi) an amino acid sequence coded by the isolated nucleic acid sequence 
15 of Claim 1; 

(ii) fragments of the amino acid squence of (i) having at least 10 amino 

acids; 

(iii) analogues of the amino acid sequences of (i) or (ii) in which one or 
more amino acids has been added, deleted, replaced or chemically modified 

20 without substantially altering the biological activity of the parent amino acid 
sequence. 

6. An amino acid sequence according to Claim 5, as depicted in SEQ ED NO:2. 

7. An isolated nucleic acid sequence coding for the amino acid sequence of 
Claim 5 or 6. 

25 8. A purified antibody which binds specifically to the amino acid sequence of 
Claim 5 or 6. 
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9. An expression vector comprising the nucleic acid sequences of Claim 1 or 7 
and control elements for the expression of the nucleic acid sequence in a suitable 
host. 

10. An expression vector comprising the nucleic acid sequence of Claim 4. and 
5 control elements for the expression of the nucleic acid sequence in a suitable host. 

11. A host cell transfected by the expression vector of Claim 9 or 10. 

12. A pharmaceutical composition comprising a pharmaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

(i) the expression vector of Claim 9; and 
io (ii) the amino acid sequence of Claim 5 or 6. 

13. A pharmaceutical composition according to Claim 12, for treatment of 
diseases which can~6e" ameliorated, cured or prevented by raising the level of the 
Chordin-Like-Homolog (CLH). 

14. A pharmaceutical composition comprising a pharmaceutically acceptable 
15 carrier and as an active ingredient an agent selected from the group consisting of: 

(i) the nucleic acid sequence of Claim 4; 

(ii) the expression vector of Claim 10; and 

(iii) the purified antibody of Claim 8. 

15. A pharmaceutical composition according to Claim 14 ? for treatment of 
20 diseases which can be ameliorated or cured by decreasing the level of the CLH 

product. 

16. A pharmaceutical composition according to Claim 12 or 14, for regulating 
the levels of bone morphogenic proteins (BMP). 

17. A pharmaceutical composition according to Claim 16 for regulating the 
25 levels of BMP-2 or BMP-4. 

18. A method for detecting an CLH nucleic acid sequence in a biological 
sample, comprising the steps of: 

(a) hybridizing to nucleic acid material— of— said— biological sample- a 

nucleic acid sequence of Claim 1 or 4; and 
30 (b) detecting said hybridization complex; 



wherein the presence of said hybridization complex correlates with the 
presence of an CLH nucleic acid seauence in the said biolosical sample. 
19. A method according to Claim 18. wherein the nucleic acid material of said 
biological sample are mRNA transcripts. 
5 20. A method according to Claim 1 8. where the nucleic acid sequence is present 
in a nucleic acid chip. 

21. A method for identifying candidate compounds capable of binding to the 
CLH product and modulating its activity the method comprising: 
~ (i) providing a protein or polypeptide comprising an amino acid 
10 sequence substantially as depicted in SEQ ID NO: 2, or a fragment of such a 
sequence; 

(ii) comparing the physiological effect of the CLH product in the 
absence and presence of said candidate compound and selecting those compounds 
which show a significant effect on said physiological activity. 
15 22. A method according to Claim 21. wherein the compound is an activator and 
the measured effect is increase in the physiological activity. 

23. A method according to Claim 21, wherein the compound is an deactivator 
and the effect is decrease in the physiological activity. 

24. An activator of the amino acid sequence of Claim 5 or 6. 

20 25. An deactivator of the amino acid sequence of Claims 5 or 6. 

26. A method for detecting CLH-product in a biological sample, comprising the 
steps of: 

(a) contacting with said biological sample the antibody of Claim 8. 
thereby forming an antibody-antigen complex; and 
25 (b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of CLH product in said biological sample. 

27. A method for detecting anti-CLH antibodies in a biological sample 
comprising the steps of: 
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(a) contacting with said biological sample the antibody of Claim 5 or 6. 
thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of anti-CLH antibody in said biological sample. 



For the Applicants. 

REEVHOLD COHN AND PARTNERS 
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<12C> NOVEL NUCLEIC ACII ANT AMINC AC" SEQUENCES 
<23C> 12 3 4 2 2 2 : CCMPUGEN7 ' 

< : 4 c> 
< : 4 : > 

< 1 € 0 > 2 

<170> Patent In Ver. 2.1 

<210>>-I 
<211> 12S1 
<212> DNA 
<213> HUMAN 

<400> 1 

cagggagcca caaggcctga tgtactgcct gcgctgtacc tgctcagagg gcgcccatgt 60 

gagttgttac cgcctccact ctccgcctgt ccactgcccc cagcctgtga cggagccaca 120 
gcaatgctgt cccaagtgtg tggaacctca cactccctct ggactccggg ccccaccaaa 180 

gtcctgccag cacaacggga ccatgtacca acacggacag atcttcagtg cccatgagct 240 

gttcccctcc cgcctgccca accagtgtgt cctctgcagc tgcacagagg gccagatcta 300 

ctgcggcctc acaacctgcc ccgaaccacg ctgcccaqca cccctcccgc tcccagactc 360 

ctgctgccaa gcctgcaaag atgaggcaag tgagcaatcg gatgaagagg acagtgtgca 420 

gtcgctccat ggggtgagac atcctcagga tccatgttcc agtgatgctg gcagaaagag 480 

aggcccgggc accccagccc ccactggcct cagcgcccct ctgagcttca tccctcgcca 540 

cttcagaccc aagggagcag gcagcacaac tgtcaagatc gtcctgaagg agaaacatan 600 

gaaagcctgt gtgcatggcg ggaagacgta ctcccacggg gaggtgtggc acccggcctt 660 

ccgtgccttc ggcccttgcc catgcatcct atgcacctgt gaggatggcc gccaggactg 720 

ccagcgtgtg acctgtccca cgaagtaccc ctgccgtcac cccgagaaag tggctgggaa 780 

gtgctgcaag atttgcccag aggacaaagc agaccctggc cacagtgaga tcagttctac 840 

caggtgtccc aaggcaccgg gccgggtcct cgtccacaca tcggtatccc caagcccaga 900 

caacctgcgt cgctttgccc tggaacacga ggcctcggac ttggtggaga tctacctctg 960 

gaagctggta aaagatgagg aaactgaggc tcagagaggt gaagtacctg gcccaaggcc 1020 

acacagccag aatttccact tgactcagat caagaaagtc aggaagcaag acttccagaa 1080 

agaggcacag cacttccgac tgctcgctgg cccccacgaa ggtcactgga acgtcttcct 1140 

agcccacacc ctggagctga aggtcacggc cagtccagac aaagtgacca agacataaca 1200 

aagacctaac agttgcagat atgagctgta taattgttgt tattatatat taataaataa 1260 

gaagttcycat aaccatcaaa a -o Q1 



<210> 2 
<211> 398 
<212> PRT 



1 



<213> HUMAN 



<400> 2 

Arg Glu Pro Gin 
1 

Gly Ala His Val 

20 

Pro Gin Pro Val 
35 

Pro His Thr Pro 
50 

Asn Gly Thr Met 
65 

Phe Pro Ser Arg 



Gly Gin lie Tyr 

100 

Ala Pro Leu Pro 
115 

Ala Ser Glu Gin 
130 

Val Arg His Pro 
145 

Gly Pro Gly Thr 



lie Pro Arg His 

180 

lie Val Leu Lys 
195 

Thr Tyr Ser His 
210 



Gly Leu Met Tyr 
5 

Ser Cys Tyr Arg 



Thr Glu Pro Gin 

40 

Ser Gly Leu Arg 
55 

Tyr Gin His Gly 
70 

Leu Pro Asn Gin 
85 ■ 

Cys Gly Leu Thr 



Leu Pro Asp Ser 

120 

Ser Asp Glu Glu 
135 

Gin Asp Pro Cys 
150 

Pro Ala Pro Thr 
165 

Phe Arg Pro Lys 



Glu Lys His Xaa 

200 

Gly Glu Val Trp 
215 



Cys Leu Arg Cys 
10 

Leu His Cys Pro 
25 

Gin Cys Cys Pro 



Ala Pro Pro Lys 

60 

Glu lie Phe Ser 
75 

Cys Val Leu Cys 
90 

Thr Cys Pro Glu 
105 

Cys Cys Gin Ala 



Asp Ser Val Gin 

140 

Ser Ser Asp Ala 
155 

Gly Leu Ser Ala 
170 

Gly Ala Gly Ser 
135 

Lys Ala Cys Val 



His Pro Ala Phe 

220 



Thr Cys Ser Glu 
15 

Pro Val His Cys 
30 

Lys Cys Val Glu 
45 

Ser Cys Gin His 



Ala His Glu Leu 

80 

Ser Cys Thr Glu 
95 

Pro Gly Cys Pro 
110 

Cys Lys Asp Glu 
125 

Ser Leu His Gly 



Gly Arg Lys Arg 

160 

Pro Leu Ser Phe 
175 

Thr Thr Val Lys 
190 

His Gly Gly Lys 
205 

Arg Ala Phe Gly 



Pro Cys Pro Cys lie Leu Cys Thr Cys Glu Asp Gly Arg Gin Asp Cys 
225 230 235 240 



— V S -w Y S 'w Y S ^VS 



_ c ^ 



"C 



/al leu Val His Thr Ser Val Ser Pre Se: Pre Asp Asr. Leu r-.rg ^rg 

290 295 SCO 

Phe Ala leu Glu His Glu Ala Ser Asp leu Val Glu He Tyr leu Trp 
305 310 -H 

lys I^eu Val Lys Asp Glu Glu Thr Glu Ala Gin Arg Gly Glu Val Pro 

325 330 335 



Glv Pre Arc: Pre Kis Ser Gin Asn Fhe His Leu Thr Gin He Lys Lys 



340 



345 



350 



Vci aIu jVS 

355 



Gin Asp Phe Gin Lys Glu Ala Gin His Phe Arg Leu Leu 



360 



365 



Ala Gly Pro His Glu Gly His Trp Asn Val Phe Leu Ala Gin Thr Leu 



370 



375 



380 



Glu Leu Lys Val Thr Ala Ser Pro Asp Lys Val Thr Lys Thr 
385 390 395 



3 
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